Word Sense Disambiguation and Text Segmentation Based on Lexical Cohesion

نویسندگان

  • Manabu Okumura
  • Takeo Honda
چکیده

In this paper, we describe ihow word sense am= biguity can be resolw'.d with the aid of lexical eo-hesion. By checking ]exical coheshm between the current word and lexical chains in the order of the salience, in tandem with getmration of lexica] chains~ we realize incretnental word sense disam biguation based on contextual infl)rmation that lexical chains,reveah Next;, we <le~<:ribe how set men< boundaries of a text can be determined with the aid of lexical cohesion. Wc can measure the plausibility of each point in the text as a segment boundary by computing a degree of agreement of the start and end points of lexical chaihs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling Lexical Stress

Human listeners use lexical stress for word segmentation and disambiguation. We look into using lexical stress for speech recognition by examining a Dutch-language corpus. We propose that different spectral features are needed for different phonemes and that, besides vowels, consonants should be taken into account.

متن کامل

Word Sense Disambiguation Using Lexical Cohesion in the Context

This paper designs a novel lexical hub to disambiguate word sense, using both syntagmatic and paradigmatic relations of words. It only employs the semantic network of WordNet to calculate word similarity, and the Edinburgh Association Thesaurus (EAT) to transform contextual space for computing syntagmatic and other domain relations with the target word. Without any back-off policy the result on...

متن کامل

UofL: Word Sense Disambiguation Using Lexical Cohesion

One of the main challenges in the applications (i.e.: text summarization, question answering, information retrieval, etc.) of Natural Language Processing is to determine which of the several senses of a word is used in a given context. The problem is phrased as “Word Sense Disambiguation (WSD)” in the NLP community. This paper presents the dictionary based disambiguation technique that adopts t...

متن کامل

Lexical stress in continuous speech recognition

Human listeners use lexical stress for word segmentation and disambiguation. We look into using lexical stress for largevocabulary speech recognition for the Dutch language. It appears that beside vowels, consonants should be taken into account. By introducing stressed phonemes, and features for spectral bands and the fundamental frequency, we reduce the word error rate by 2.6 %.

متن کامل

Chinese Lexical Analysis Using Hierarchical Hidden Markov Model

This paper presents a unified approach for Chinese lexical analysis using hierarchical hidden Markov model (HHMM), which aims to incorporate Chinese word segmentation, Part-Of-Speech tagging, disambiguation and unknown words recognition into a whole theoretical frame. A class-based HMM is applied in word segmentation, and in this level unknown words are treated in the same way as common words l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994